Simulating longer vectors of correlated binary random variables via multinomial sampling
نویسنده
چکیده
The ability to simulate correlated binary data is important for sample size calculation and comparison of methods for analysis of clustered and longitudinal data with dichotomous outcomes. One available approach for simulating length n vectors of dichotomous random variables is to sample from the multinomial distribution of all possible length n permutations of zeros and ones. However, the multinomial sampling method has only been implemented in general form (without first making restrictive assumptions) for vectors of length 2 and 3, because specifying the multinomial distribution is very challenging for longer vectors. I overcome this difficulty by presenting an algorithm for simulating correlated binary data via multinomial sampling that can be easily applied to directly compute the multinomial distribution for any n. I demonstrate the approach to simulate vectors of length 4 and 8 in an assessment of power during the planning phases of a study and to assess the choice of working correlation structure in an analysis with generalized estimating equations.
منابع مشابه
Structural Biology, Biochemistry, and Biophysics BIOPHYSICAL CHARACTERIZATION OF NATURALLY OCCURING TITIN
SIMULATING DEPENDENT BINARY DATA WITH RANDOM EFFECTS. Aobo Wang, Roy T. Sabo, Department of Biostatistics, Virginia Commonwealth University, Richmond, Virginia 23298-0032. Dependent binary data can be simply simulated using the multinomial sampling method. We extend this method to simulate dependent binary data with clustered random effect structures. Several distributions are considered for co...
متن کاملMonitoring Multinomial Logit Profiles via Log-Linear Models (Quality Engineering Conference Paper)
In certain statistical process control applications, quality of a process or product can be characterized by a function commonly referred to as profile. Some of the potential applications of profile monitoring are cases where quality characteristic of interest is modelled using binary,multinomial or ordinal variables. In this paper, profiles with multinomial response are studied. For this purpo...
متن کاملMaxima of the Cells of an Equiprobable Multinomial
Consider a sequence of multinomial random vectors with increasing number of equiprobable cells. We show that if the number of trials increases fast enough, the sequence of maxima of the cells after a suitable centering and scaling converges to the Gumbel distribution. While results are available for maxima of triangular arrays of independent random variables with certain types of distribution, ...
متن کاملrTableICC: An R Package for Random Generation of 2×2×K and R×C Contingency Tables
In this paper, we describe the R package rTableICC that provides an interface for random generation of 2×2×K and R×C contingency tables constructed over either intraclass-correlated or uncorrelated individuals. Intraclass correlations arise in studies where sampling units include more than one individual and these individuals are correlated. The package implements random generation of contingen...
متن کاملGenerating Spatial Correlated Binary Data Through a Copulas Method
Simulating spatial correlated binary data is very important on many cases, but it is not easily to accomplish, as there are restrictions on the parameters of Bernoulli variables. This paper develops a copulas method to generate spatial correlated binary data. The spatial binary data generated by this method has an inverse spatial pattern comparing with the latent Gaussian random field data, how...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Computational Statistics & Data Analysis
دوره 114 شماره
صفحات -
تاریخ انتشار 2017